Visualizing Profiles of Large Datasets of Weighted and Mixed Data

نویسندگان

چکیده

This work provides a procedure with which to construct and visualize profiles, i.e., groups of individuals similar characteristics, for weighted mixed data by combining two classical multivariate techniques, multidimensional scaling (MDS) the k-prototypes clustering algorithm. The well-known drawback MDS in large datasets is circumvented selecting small random sample dataset, whose are clustered means an adapted version algorithm mapped via MDS. Gower’s interpolation formula used project remaining onto previous configuration. In all process, distance measure proximity between individuals. methodology illustrated on real obtained from Survey Health, Ageing Retirement Europe (SHARE), was carried out 19 countries represents over 124 million aged Europe. performance method evaluated through simulation study, results point that new proposal solves high computational cost low error.

برای دانلود باید عضویت طلایی داشته باشید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

MultiLayerMatrix: Visualizing Large Taxonomic Datasets

Adjacency matrices can be a useful way to visualize dense networks. However, they do not scale well as the network size increases due to limited screen space, especially when the number of rows and columns exceeds the pixel height and width of the screen. We introduce a new scalable technique, MultiLayerMatrix, to visualize very large matrices by breaking them into multiple layers. In our techn...

متن کامل

Visualizing Large Datasets in TOPCAT v4

Abstract. TOPCAT is a widely used desktop application for manipulation of astronomical catalogs and other tables, which has long provided fast interactive visualization features including 1, 2 and 3-d plots, multiple datasets, linked views, color coding, transparency and more. In Version 4 a new plotting library has been written from scratch to deliver new and enhanced visualization capabilitie...

متن کامل

Clustering Large Datasets of Mixed Units

In the paper we propose an approach for clustering large datasets of mixed units based on representation of clusters by distributions of values of variables over a cluster – histograms, that are compatible with merging of clusters. The proposed representation can be used also for clustering symbolic data. On the basis of this representation the adapted versions of leaders method and adding meth...

متن کامل

Visualizing and Analyzing Large and Detailed 3d Datasets

This paper presents a set of tools developed to model, visualize and analyze large 3D datasets built from 2D and 3D sensor data. These tools, grouped under the Atelier3D framework, first include a technique for processing and interactively visualizing datasets made of hundreds of millions 3D samples and tens of gigabytes of texture from digital photography. It also includes various analysis too...

متن کامل

Visualizing Large Datasets of Images in Web Analytics

Web analytics tools fail to report information about rich media elements like images and videos. The usage of rich media elements on websites is increasing exponentially over the time. In this paper, we introduce new visualization technique to visualize rich media elements as a part of web analytics framework. We’ve used data from recent user studies conducted on our image intensive mobile appl...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Mathematics

سال: 2021

ISSN: ['2227-7390']

DOI: https://doi.org/10.3390/math9080891